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Weapon  target  assignment  is  a  weapon  assignment  problem  with  the  objective  of  minimiz¬ 
ing  the  expected  survival  value  of  targets.  This  problem  must  be  quickly  solved  on  the  bat¬ 
tlefield  (i.e.,  in  real-time).  Considering  the  combined  complexity  and  the  strict  time 
constraints,  a  fuzzy  decision  maker  is  proposed  as  an  alternative  to  aid  commanders  in 
deciding  on  proper  weapon  assignments.  The  concept  builds  a  fuzzy  decision  maker  for 
a  given  data  set  by  using  extended  grid  partitioning  based  on  a  sensitivity  analysis  of  input 
variables.  The  proposed  decision  maker  is  implemented  and  tested  with  several  randomly 
sampled  assignment  instances  on  realistic  scenarios.  In  addition,  a  larger-scale  scenario  is 
also  considered  to  demonstrate  the  scalability  of  the  approach.  The  results  show  that  the 
proposed  system  exhibits  satisfactory  performance  and  is  serviceable  on  the  battlefield. 

©  2013  Elsevier  Inc.  All  rights  reserved. 


1.  Introduction 

Weapon  Target  Assignment  (WTA)  refers  to  the  reactive  assignment  of  defensive  weapons  to  engage  or  counter  identified 
threats.  As  an  illustrative  example,  consider  the  air  defense  of  a  naval  battle  group.  The  assets  being  defended  are  aircraft 
carriers,  escort  warships  and  support  ships,  which  all  have  some  intrinsic  value  to  the  defense.  The  threats  are  enemy 
anti-ship  missiles  launched  from  submarines,  surface  ships  and  air  platforms.  The  weapons  are  any  measure  that  can  be  used 
to  eliminate  the  threats,  e.g„  guided  missiles,  fast  firing  guns,  etc.  The  problem  is  to  assign  the  proper  weapon  to  each  threat. 
The  primary  concern  is  minimizing  the  total  expected  lethality  of  the  enemy  missiles  with  a  limited  number  of  weapons.  This 
problem  is  known  as  WTA  in  Operations  Research;  interested  readers  can  refer  to  [8,13,15,23,26]  for  a  comprehensive  liter¬ 
ature  review  of  WTA. 

Weapons  are  assigned  to  threats  in  stages  based  on  the  observed  outcomes  of  previous  stages  [15],  There  is  limited  time 
to  observe,  decide  and  act;  the  problem  must  be  solved  in  a  short  period  of  time.  For  example,  a  naval  battle  group  can  detect 
an  enemy  anti-ship  missile  at  10  km  away;  this  allows  for  (at  most)  10  s  to  counter  the  anti-ship  missile.  It  is  therefore  cru¬ 
cial  to  act  as  close  to  real-time  as  possible,  provided  that  all  acts  are  consistent  with  the  mission  objectives  and  compliant 
with  platforms  and  environmental  constraints. 

Various  solvers  have  been  developed  for  WTA.  There  are  global  optimum  solvers,  such  as  cutting  plane  techniques  and 
branch  and  bound  algorithms.  Ahuja  et  al.  [1,2]  proposed  several  branch  and  bound  algorithms  with  various  bounding 
strategies  for  WTA.  In  addition,  several  exact  solvers  have  been  proposed  with  some  restrictive  conditions,  i.e.,  when 
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all  the  weapons  are  identical  [12,16]  or  when  targets  can  receive  at  most  one  weapon  [9],  There  are  also  several  global 
search  solvers,  such  as  genetic  algorithms  [3,10,20,36],  simulated  annealings  [7],  discrete  particle  swarm  optimizations 
[35],  permutation  and  tabu  search  heuristics  [32],  and  a  hybrid  algorithm  of  a  genetic  algorithm  with  an  ant  colony  opti¬ 
mization  [19], 

WTA  is  known  to  be  a  typical  Non-Polynomial  Complete  (NPC)  problem  [21]  such  that  any  enumeration-based  solver 
faces  exponential  computational  complexity  as  the  problem  size  increases.  In  other  words,  while  any  enumeration  method 
may  ensure  a  global  solution,  its  computational  requirement  grows  exponentially  as  the  number  of  targets  and/or  weapons 
grows.  Problems  on  the  battlefield  must  be  solved  quickly,  i.e„  close  to  real-time.  The  combinatorial  complexity  of  the  prob¬ 
lems  implies  that,  even  with  the  supercomputers  available  today,  optimal  solutions  cannot  be  obtained  close  to  real-time, 
leaving  insufficient  time  to  act.  However,  while  global  search  solvers  are  reported  to  be  efficient  to  some  extent,  none  of 
them  can  guarantee  optimal  results.  More  importantly,  they  cannot  guarantee  the  competence  of  a  result  obtained  after  a 
certain  run  time.  To  address  the  computational  burden  of  exact  solvers  and  the  search  times  of  global  search  solvers,  several 
sophisticated  search  algorithms  and  heuristics  have  been  recently  proposed.  Rosenberger  et  al.  [25]  and  Bogdanowicz  [5] 
applied  an  auction  algorithm  for  WTA.  Lee  [18]  presented  an  enhanced  large-scale  neighborhood  (VLSN)  search  algorithm 
for  WTA.  Xin  et  al.  [31  ]  proposed  a  rule-based  heuristic  using  the  domain  knowledge  of  WTA.  They  reported  a  very  efficient 
computation  time,  even  for  large-scale  WTA,  but  they  could  not  ensure  a  global  solution;  the  auction-based  solvers  only 
guarantee  optimality  in  one-to-one  assignment  cases.  However,  VLSN  search  algorithms  and  rule-based  heuristics  can  return 
a  feasible  solution  (a  solution  satisfying  all  constraints)  of  the  highest  possible  quality.  Interested  readers  can  refer  to  [6]  for  a 
general  overview  of  the  meta-heuristics  of  hard  problems.  However,  none  of  the  aforementioned  solvers  satisfies  optimality 
on  a  strict  time  constraint. 

Rule-based  expert  systems  are  increasingly  preferred  over  controlled  complex  real-time  systems.  The  complexity  and  the 
strict  time  constraint  suggest  a  rule-based  expert  system  as  an  alternative  solver.  Rule-based  mapping  is  known  to  be  effi¬ 
cient  in  computation  time  by  its  nature.  However,  rule-based  systems  may  appear  to  have  an  infeasible  number  of  rules  for 
such  complex  problems.  The  approximate  reasoning  theory  [33],  along  with  the  compositional  rule  of  inference  method,  pro¬ 
vides  a  useful  framework  for  constructing  a  rule-based  solution  with  a  reasonable  number  of  rules.  Observations  on  a  bat¬ 
tlefield  are  characterized  by  imprecision  and  uncertainty.  Based  on  the  problem  complexity,  the  strict  time  constraints,  the 
number  of  rules,  and  the  imprecision  and  uncertainty  of  observations,  we  propose  to  apply  the  approximate  reasoning  the¬ 
ory  to  construct  a  rule-based  solver  as  an  alternative  decision  aid  system  for  helping  battlefield  commanders  decide  on  prop¬ 
er  WTA.  In  our  previous  work  [27,28],  we  employed  the  idea  of  applying  the  approximate  reasoning  theory  for  WTA. 
However,  we  could  not  analyze  the  theoretical  aspects  and  the  results  were  insufficient.  In  this  work,  we  can  better  justify 
the  concept  by  including  an  analysis  of  the  approximation  properties. 

The  primary  goal  of  this  work  is  to  show  the  applicability  of  the  approximate  reasoning  theory  for  WTA.  The  challenge  is 
to  extract  assignment  rules  and  generate  a  rule-base.  We  utilize  a  grid  partitioning  approach  [14]  in  this  work  and  extend  it 
in  such  a  way  that  regions  having  a  large  effect  on  outputs  are  partitioned  into  smaller  grids.  To  show  the  principle  of  the 
approach,  we  concentrate  on  small-scale  WTA,  i.e.,  a  single  platform  perspective;  these  small-scale  problems  include  a  small 
number  of  weapons  and  targets.  For  example,  [4,32]  defined  single  platform  scenarios  with  only  five  weapons  and  three  tar¬ 
gets.  Simulated  results  showed  the  superiority  of  the  proposed  model  at  a  small-scale  WTA.  However,  WTA  can  be  consid¬ 
ered  from  several  different  perspectives  from  single  platform  combat  to  force  coordination  [24],  In  the  force  perspective,  the 
number  of  weapons  may  easily  reach  100  or  more.  Ahuja  et  al.  [1,2]  classified  such  force  perspective  problems  as  large-sized 
and  fairly  large-sized.  To  demonstrate  the  scalability  of  the  approach,  a  large-scale  scenario  is  also  considered  in  this  work. 
The  results  show  that  the  proposed  system  exhibits  satisfactory  performance  and  is  serviceable  on  a  battlefield. 

Section  2  introduces  a  formal  formulation  of  WTA.  Section  3  presents  the  fuzzy  decision  maker.  Section  4  discusses  the 
selection  of  the  grid  counts  and  the  grid  partitioning.  Section  5  analyzes  the  approximation  properties.  Section  6  presents  the 
simulation  results;  several  existing  solutions  were  also  employed  for  comparison.  The  proposed  decision  aid  system  outper¬ 
forms  its  competitors  on  all  test  problems.  Finally,  Section  7  concludes  the  paper. 

2.  Problem  formulation:  Weapon  Target  Assignment  (WTA) 

Research  on  WTA  dates  back  to  the  1950s,  where  WTA  modeling  issues  were  first  investigated  [11,22];  since  then,  the 
mathematical  models  have  significantly  improved  [1,2,20,21,35],  The  problem  is  formulated  as  a  non-linear  integer  pro¬ 
gramming  problem,  with  the  objective  of  minimizing  the  total  lethality  of  the  targets.  The  problem  variables  are  the  kill 
probabilities  of  the  weapon-target  pairs  and  the  lethality  values  of  the  targets.  The  kill  probability  is  the  weapon’s  probabil¬ 
ity  of  destroying  the  target  if  engaged.  It  depends  on  all  aspects  of  engagement,  such  as  the  type  of  weapon  and  the  type, 
state,  and  location  (range,  sector)  of  the  target.  The  lethality  value  indicates  how  lethal  the  target  is  to  the  asset  if  it  survives 
to  its  destination.  This  depends  on  which  asset  is  at  risk,  as  well  as  the  type,  location  (range,  sector),  velocity,  and  course  of 
the  target.  This  formulation  allows  for  a  preferential  defense  under  a  heavy  attack;  it  may  be  optimal  to  leave  the  targets 
aimed  at  low-value  assets  undefended  and  concentrate  on  destroying  the  targets  aimed  at  high-value  assets. 

Consider  the  assignment  problem  of  w  weapons  to  t  targets:  Let  py  e  [0, 1  ]  be  the  kill  probability  of  weapon  j  for  target  i 

and  Vi  e  [0, 1  ]  be  the  survival  value  of  target  i  where  i  =  1 , . . . ,  t  and  j  =  1 . w.  The  WTA  is  formulated  as  a  non-linear  integer 

programming  problem  as  follows: 
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minimize 


E4(1  ~Pa)dv 


(la) 


subject  to 


Vj  =  1, . . .  ,w 
d,j  e  {0, 1},  Vi  =  l,...,t 


(lb) 


(lc) 


:  V  J  —  1 . W 


where  dy  is  the  decision  value  indicating  if  weapon)  is  assigned  to  target  i.  In  the  above  formulation  (1),  we  minimize  the 
sum  of  the  weighted  cumulative  probability  of  threat  lethality  while  ensuring  that  a  weapon  can  be  assigned  to  (at  most)  one 
target.  We  may  consider  adding  additional  constraints,  such  as  a  lower  bound  on  the  reduction  in  target  lethality  value  or  a 
lower  and  upper  bound  on  the  number  of  weapons  assigned  to  a  target. 


3.  Fuzzy  Decision  Maker  (FDM) 

Consider  the  assignment  of  w  weapons  to  t  threats.  Let  us  then  collect  all  problem  variables,  i.e.,  vt  and  py  where  i  =  1 , . . . ,  t 
and  j  =  1 . w,  in  an  n  =  t  +  tw  dimensional  vector  as  follows: 

x=  (Xi,x2,...,x„)  (2a) 

=  (vu...,vt,pn,...,pn,...,ptw)el  (2b) 

where  I  is  the  set  of  all  input  vectors,  i.e.,  I  =  {(Xj,  x2 . xn)|x,-  e  [0, 1  ],  V  i  =  1 . n}.  Similarly,  let  us  also  collect  all  decision 

values,  i.e.,  dy,  where  i  =  1 . t  and)  =  1 . w,  in  an  m  =  tw  dimensional  vector  as  follows: 


y  =  (yi,y2,-,ym) 

=  (dn,  ■  ■  ■  ,dn,d12,  ■  ■  ■  ,dt2, . . .  ,dM)  e  0 


(3a) 

(3b) 


where  0  is  the  decision  vector  set,  i.e.,  0  =  {(yi,y2,  •  •  •  ,ym)|yi  6  {0,1},  V  i  =  1 . m}. 

The  decision  vector  set  is  actually  a  finite  set  with  c  =  2m  possible  assignment  decisions.  Thus,  we  may  rewrite  0  as 
0  =  [class i,  class2, ....  c/assj,  where  class  is  any  possible  assignment  decision.  For  example,  class i  =  (0,  0,  0, ... ,  0)1><m  means 
no  assignment,  class2  =  (1, 0,  0, . . . ,  0)i  xm  means  that  weapon)  =  1  is  assigned  to  only  threat  i  =  1  and  all  other  weapons  are 
not  assigned.  In  this  context,  the  WTA  can  be  modeled  as  a  classification  problem  where  the  input  pattern,  i.e.,  x  e  /,  is  as¬ 
signed  to  one  of  the  given  set  of  decisions,  i.e.,  y  e  0  =  [class i,  class2 c/assc}. 

The  discussion  above  shows  that  the  WTA  can  be  modeled  as  a  classification  problem.  Therefore,  a  single  fuzzy  classifier 
(e.g.,  a  Mamdani-type  classifier)  can  be  used  to  solve  the  problem.  However,  we  must  handle  a  considerable  number  of  clas¬ 
ses,  resulting  in  an  impractical  rule-base  size,  even  for  a  small-scale  assignment  problem.  For  example,  the  total  class  count 
would  be  c  =  225  =  33,554,432  for  five  weapons  and  five  targets.  An  obvious  solution  is  to  remodel  the  problem  as  a  union  of 
individual  classification  problems. 

Based  on  the  given  rationale,  we  divide  set  O  into  w  individual  subsets  as  O  =  Oi  U  . . .  U  Oj  u  . . .  u  Ow  such  that  each  holds 
the  possible  assignment  of  only  one  weapon.  That  is,  Oj  =  {classf,,  •  •  • ,  dassj, . . . ,  riassj  J  is  the  set  holding  the  possible  assign¬ 
ments  of  weapon)  =1,2 . w  where  class =  (0,0, . . . .  0),  xt  means  no  assignment  and  classli  =  (0,0, . . . ,  1, . .  .0)lxt  indicates 

that  weapon)  is  assigned  to  threat  i.  Thus,  the  FDM  can  be  built  as  a  union  of  w  individual  Mamdani-type  classifiers.  More¬ 
over,  the  cascaded  structure  shown  in  Fig.  1  can  reduce  the  number  of  inputs  for  each  classifier.  Therefore,  it  is  sufficient  to 
handle  only  t  +  1  classes  and  t  inputs  within  a  single  classifier,  obviously  decreasing  the  complexity. 

As  shown  in  Fig.  1,  the  FDM  consists  of  two  units:  the  Fuzzy  Transformation  Unit  (FTU)  and  the  Fuzzy  Classifier  Unit 
(FCU).  The  first  unit  (FTU)  approximates  the  data  set  to  an  intermediate  domain  where  assignment  decisions  are  not  neces¬ 
sarily  integers.  The  second  unit  (FCU)  classifies  the  FTU  output  to  a  valid  assignment  decision.  The  FTU  actually  performs  a 
non-linear  transformation  of  the  original  input  pattern  to  an  intermediate  pattern.  The  transformation  is  expected  to  in¬ 
crease  the  discrimination  performance  of  the  FCU,  leading  to  a  more  accurate  decision.  Moreover,  the  FCU  receives  a  subset 
of  the  intermediate  pattern  (instead  of  the  entire  pattern)  without  regard  to  informational  loss.  Therefore,  the  FCU  can  be 
constructed  with  a  less  complex  rule-base.  In  the  following,  we  proceed  with  the  details  of  each  unit. 

3.1.  Fuzzy  Transformation  Unit  (FTU) 

As  depicted  in  Fig.  1,  (2)  is  the  FTU  input.  The  goal  is  to  approximate  this  input  to  an  intermediate  pattern  denoted  by 

vector  u  =  (iii,  u2 . um).  Let  U  be  the  set  of  all  possible  intermediate  patterns.  That  is,  U  =  {(u i,  u2 . um)|u,-  e  [0,1];  V 

i=  1, . . . ,  m}.  Note  that  u,  does  not  necessarily  constitute  a  valid  decision. 
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y  =  (2/1, 2/2,-,  2/m) 


X  =  (xi,X2,-.,Xn) 

=  (v1,...,vt,p11,...,ptl,...,ptw)  e  I 

Fig.  1.  Structure  of  the  Fuzzy  Decision  Maker  (FDM). 


Suppose  that  the  domain  interval  of  each  input  variable  is  partitioned  into  h,  fuzzy  sets  such  as  Aj.  where  i  =  n  and 
ji  =  1 . Let  Aj,  j2  .  jn  be  the  fuzzy  set  defining  the  rule  premise,  i.e.,  Ajt  j2  Jn  =  A?  x  A?  x  . . .  x  A;" .  The  FTU  is  then  charac¬ 

terized  by  the  following  rule  set. 


RULE,-  j2, ...Jn  :  IF  x  e  Ahht...jn  THEN  u  is  Bjlj2  jn 

(4) 

where  ^  ^  e  U  is  the  rule  output.  Let  pA<  (.)  be  the  membership  function  corresponding  to  fuzzy  si 
then  evaluated  by  ' 

-t  Aj- .  The  Bj]  j2  jn  is 

^  (*)y 

'1J2,  Jn  J2{xy)eslJ-Ahj2  h  (*) 

where  (x,y)  is  any  data  tuple  in  the  training  data  set  S  and  fXA,  ,  (x)  =  n".  1  M- 

Once  the  rule-base  is  formed,  we  may  represent  the  FTU  as'a  fakagi-Sugeno'XTS)  model  [30], 

(5) 

Ej^enM^Jx) 

(6) 

where  H  =  U1J2,  ■  ■  ■  ,jn[?i  =  1,2 . h,;  j2  =  1, 2 . h2 _ ,jn  =1,2 . hn}.  Note  that  the  TS-type  fuzzy  system  does  not  re- 

quire  a  separate  defuzzifier. 

To  further  simplify  the  model  (6),  we  define 

.  ,  ,  ,4(x) 

(7) 

(8) 

Using  the  above  notations,  the  FTU  is  finally  expressed  as  follows: 

u  =  FTU(x) 

(9a) 

-  E  1..-JU 

(9b) 
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where 

%h,  j-  =  E  <pm 2,-jn^y 

(x,y)es 


(10) 


3.2.  Fuzzy  Classifier  Unit  (FCU) 


As  depicted  in  Fig.  1,  there  exist  w  FCUs.  To  reduce  the  rule-base  complexity,  we  relate  each  FCU  with  only  one  weapon. 
That  is,  each  FCU  classifies  (maps)  a  part  of  the  intermediate  pattern  (i.e.,  FTU  output)  to  an  assignment  decision  for  only  one 

weapon.  For  example,  u  ,  =  (Uy_i)t+i.  Uq-  ,)t+2 . u0-  ,)t+t)  c  U  is  the  input  of  the  FCU  related  to  weapon  j  where  j  =  1, . . . ,  w. 

Note  that  the  total  number  of  input  variables  is  only  t.  For  this  example,  the  possible  assignment  decisions  are 
Oj  —  |dassi,, . . . ,  class j, . . . ,  class)  j  c  0,  where  class j,  =  (0, 0, . . . ,  0)lxt  means  no  assignment  and  c/assj  —  (0, 0. ....  1 ... .  0)lxt 
indicates  that  weapon  j  is  assigned  to  target  i. 

Suppose  that  the  domain  interval  of  each  input  variable  is  partitioned  into  g,-  fuzzy  sets:  Cj(,  where  i  =  1 . m  and  j;  =  1,  - 

....  gj.  Thus,  for  weapon  j  =  1 . w,  the  FCU  is  characterized  by  the  following  rule  set: 

RULEy2,.„A  :  IF  Uj  e  Ckh„..Jt  THEN  yj  is  Djlj2  ...jt  (11) 

where  CjIifei...jf  is  the  fuzzy  set  defining  the  rule  premise.  That  is,  =  Cj^1*^1  x  Cj^1^2  x  . . .  x  Cj*.  Moreover,  Dj, j2  jt  is 

rule  output  and  is  the  common  decision  encountered  in  set 

=  {(y(M)t+i  .ys-i)t+2,---,yjt)l(u,y)  e  s',  u,  e  c^}  (12) 

where  S'  is  the  training  data  set  of  the  FCUs.  S'  is  obtained  from  the  training  data  set  S  as  follows: 

S'  =  {(fi,y>|(x,y>eS1u  =  FTU(x)}  (13) 

Once  the  rule-base  is  formed,  it  is  easy  to  obtain  the  corresponding  fuzzy  classifier.  Let  / 1 ^  (.)  be  the  membership  function 
corresponding  to  fuzzy  set  Cj..  The  rule  firing  strength  for  the  FCU  of  weapon;  =  1, ....  w,  is  then  computed  by  the  following 
equation: 


hj2,:jSui) 

where 


(14) 


(15) 


Gj  =  (OV-DW  J(j-  dm.  •  ■  ■  Jjt)  UVi}t+i  —3,2,...  ■..  .;jjt=  1,2...,  gJt}  (16) 

Once  the  firing  strengths  for  weapon  j  =  1 . w  are  determined,  the  output  of  the  FCU  is  evaluated  by 

Vj  DiiJi  j;  (17) 


fvfz,  ■  ■  ■  ,ft  =  arg  .max  jt(Uj)) 


(18) 


4.  Training 

To  apply  grid  partitioning,  we  must  properly  set  grid  counts  (fif  and  gj)  while  training  both  the  FTU  and  the  FCU.  Obvi¬ 
ously,  the  choice  of  grid  counts  has  a  significant  influence  on  the  performance  of  the  trained  rule-base.  If  the  count  is  too 
small,  the  rule-base  will  be  insufficient  for  modeling  non-linear  behavior.  However,  if  it  is  too  large,  the  corresponding 
rule-base  will  be  too  complex  to  implement. 

Suppose  that  we  have  an  upper  bound  on  the  rule  count  and  a  corresponding  upper  bound  on  the  number  of  grids.  The 
input  variables  that  have  a  high  influence  on  output  should  obviously  be  partitioned  with  the  smallest  possible  grids.  Based 
on  this  rationale,  the  rule-base  can  achieve  satisfactory  performance,  even  with  a  lower  bound  on  the  rule  count.  In  this  sec¬ 
tion,  we  propose  an  extended  grid  partitioning  methodology  that  uses  a  sensitivity  level  for  each  input  variable  to  set  grid 
counts. 
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4.1.  Sensitivity  analysis  of  input  variables 


Let  [y,,y2 . ym]  =Rx i,x2 . x„)  be  a  multi-input/multi-output  non-linear  function.  Let  {(x-i.y-i),  (x2,y2),  ■  •  • ,  (Xq,yq)}  be 

a  set  of  q  data  derived  from  function  /.  In  statistics,  a  non-linear  regression  method  suggests  approximating  a  non-linear 
function  with  a  linear  one  [29],  Thus,  we  may  approximate  the  derivative  of  the  input  variables  K  as  follows: 

AY  =  KAX  (19) 

where 


yi  -y2 

yi  -ys 


yi-yj 

yq  i-yq 


x,  -x2 
X,  -x3 


Xq  1  -X,. 

rses  where  there  is  no  exact  solution  of  (19),  we  may  approximate  If  by  following  the  pseudo-inverse  formula: 


k=(axtax)  Way  (22) 

Each  element  of  if  (19)  implies  an  approximate  rate  of  change  in  output  related  to  the  rate  of  change  in  input.  In  other 
words,  K  holds  the  sensitivity  level  for  each  input  variable.  Thus,  we  may  evaluate  the  sensitivity  level  for  any  input  variable 
x,-  where  i  =  1 . n  as  follows: 


,  II  fen  kq  ...  kim  II 

'  EMIfepi  fep2  km 


(23) 


where  [|-J|  is  the  L-norm  (L  =  2)  and  [  kn  ki2  ...  kim  ]  is  the  ith  row  of  the  matrix  K. 


4.2.  Rule  extraction  based  on  sensitivity  analysis 

Based  on  the  sensitivity  analysis  just  described,  the  sensitivity  level  for  each  input  variable  of  an  FTU  can  be  obtained,  i.e., 
/,•  for  each  x,  where  i  =  1, . . . ,  n.  We  first  form  the  input  variation  matrix  and  the  output  variation  matrix  on  the  training  data 
set  S  as  in  (20)  and  (21).  Once  we  determine  AX  and  AY,  we  can  easily  evaluate  the  sensitivity  level  by  (23)  for  each  input 
variable. 

Suppose  that  we  have  an  upper  bound  ( MaxRuleCount )  on  the  rule  count  for  the  FTU.  In  grid  partitioning,  the  rule  count  is 
equal  to  the  multiplication  of  the  grid  counts. 

RuleCount  =  ]>  (24) 

Using  the  result  of  the  sensitivity  analysis,  we  can  identify  grid  count  h,  as  follows: 

(25) 

for  each  x,  where  i  n. 

As  for  FCUs,  we  can  form  an  input  variation  matrix  for  the  training  data  set  S'  (see  (13))  as  follows: 
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U,  —  U2 
U!  —  U3 


AU  = 


(26) 


Uj-Uj 


|_Uq  ,  -UqJ 

Note  that  the  output  variation  matrix  AY  is  already  formed  above. 

Similar  to  the  identification  of  the  grid  count  for  the  FTU,  we  identify  the  grid  count  gt  for  the  FCU  as  follows: 


K 


MaxRuleCount 


(27) 


where  MaxRuleCount  is  the  upper  bound  on  the  total  rule  count  for  the  FCU  layer. 

By  adopting  the  described  methodology,  we  can  properly  set  grid  counts  for  a  given  upper  bound  on  a  total  rule  count. 
Moreover,  rule-bases  will  be  more  specific  for  input  variables,  thereby  having  a  significant  effect  on  output  yet  general  en¬ 
ough  to  cover  variations  for  all  input  variables.  Thus,  we  may  implement  more  accurate  fuzzy  decision  makers,  even  with 
limited  hardware  (i.e.,  a  bound  on  the  rule  count),  compared  to  standard  grid  partitioning.  Note  that  the  concept  approxi¬ 
mates  the  underlying  function  through  a  continuous  and  linear  function.  Obviously,  the  approximation  error  will  increase 
as  the  underlying  function  deviates  from  being  continuous  and  linear. 

5.  Approximation  analysis 

To  analyze  the  FDM  approximation  mechanism,  we  initially  introduce  the  following  definitions: 

Definition  1  ( Normal  Fuzzy  Set  [34]).  A  fuzzy  set  A  on  a  domain  U  is  said  to  be  a  normal  fuzzy  set  if  there  exists  at  least  one 
x  e  U  such  that  the  corresponding  membership  function  returns  1,  i.e.,  Pa(x)  =  1. 

Definition  2  ( Completeness  of  Partition  [34]).  Fuzzy  sets  A,  where  i  =  1 . h  form  a  complete  partition  on  a  domain  U  if  for 

any  xe  U  there  exists  at  least  one  fuzzy  set  with  juAj(x)  >  0. 

Definition  3  ( Consistency  of  Partition  [34]).  Fuzzy  sets  A,  where  i  =  1 _ _ h  form  a  consistent  partition  on  a  domain  U  if 

/4((x)  =  1  for  any  xeU  and  p.  (x)  =  0  for  all  j  #  i. 

Definition  4  (Fuzzy  Set  Support).  The  fuzzy  support  of  an  input  value  x  e  U  is  a  union  of  fuzzy  sets  A,-  of  which  support 
includes  the  given  inputs,  i.e.,  P(x)  —  Uxr/1|.A(. 

With  the  above  definitions,  the  approximation  mechanism  of  the  FTU  (9)  is  defined  by  the  following  theorem: 

Theorem  1  (Approximation  Mechanism  of  FTU).  Suppose  that  fuzzy  sets  A]  are  normal,  complete  and  consistent  on  training 

data  set  S  where  i  =  1 . n,  j)  =  1 . h,.  Note  that  ht  is  the  grid  count  for  the  ith  variable.  Then,  for  any  xel  and  corresponding 

optimal  assignment  decision  vector  y*  e  0,  the  approximation  error  is 


e=||y*-F7U(x)|| 

HI r-  E 


(28a) 

(28b) 


(28c) 

(28d) 


where  Ah  =  A?  xA?  x  ■  ■  ■  x  A,"  ,  P(x)  is  the  fuzzy  set  support  of  input  x  and  Fl(x)  =  {j,  ,j2 ,jn  |x  6  A,,  j2r..jn  ;j,  ,j2 , . . . ,  jn  e  H}. 
Note  that,  ||  ||  is  the  2-norm  or  the  Euclidean  length  of  the  input. 


Proof  1.  Because  AJlj2i ...Jn  are  complete  and  consistent  fuzzy  sets,  there  exists  at  least  one  Ajtj2„. ,j„  such  that  x  e  Aj 1j2,...jn, 
where  ji,j2 . jn  e  H.  Recall  that  H(x)  =  ■  ■  ■  ,j„|X  e  Ahh,-in'ji U2.  ■  •  •  Jn  e  H  Therefore, 

Jl  J2,  ■  ■  ■  Jn  6  H(X)  \b....Jn  *=*  hj2,-Jn(X)  >  0 


(29) 
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However, 

Jt  J2 ,  •  •  •  J,  *  W(x)  <=►  x  *  Aj, «=» 4 j2,...Jn (x)  =  0  (30) 

In  addition,  note  that 

E  = 1  (31a) 

^^(x)  =  1  (31b) 

(x,y)es 

Having  (30)  and  (31), 

e=||y*-fT[/(x)||  (32a) 

~jy-  E  4,.Jn(X)%.-ln|  (32b) 

=  E  (32c) 

II 

=  |  E  4,A,(x'(y  bj,- j„)  |  (32d) 

<  .  max  ||(y*  Bj,  jj|  (32e) 

Now,  simplify  the  Right  Hand  Side  (RHS)  of  (32e)  as  follows: 

iuT*Jr  ~  Bj,""jnl1 
=jlrTJ(x)|y* -  E 

E/A,J„(x)(y*-y)| 


Note  that  (pjt  jn(x)  is  non-zero  for  x  e  A,-,  j2,  Jn.  Thus,  we  may  drive  the  following  result. 

x  e  =>  %,..j„(X)  >  0  (34) 

Let  jt, . . . ,  jn  e  H(x).  Thus,  we  may  rewrite  (34)  as  the  following: 

it,  ■  •  ■  Jn  e  H(X)  A  x  6  P(x)  =►  <ph . jn (x)  >  0  (35) 

Having  (35),  we  may  further  simplify  the  RHS  of  (32e)  as  follows: 

RHS  H  max  ||y*  -  y||  □  (36) 

Theorem  1  introduces  a  local  approximation  property  of  the  FTU  in  a  compact  form.  We  extend  this  local  property  to  a 
global  one,  thus  setting  a  bound  on  the  FTU  output  by  the  following  theorem: 

Theorem  2  (Approximation  Bound  of  FTU).  Suppose  that  fuzzy  sets  Aj  of  the  FTU  (6 )  are  normal,  complete  and  consistent  on  the 
data  set  S. 

i.  Let 

=jup  lljT-Bj,, ...jJ  (37) 

be  the  approximation  error  of  any  FTU  rule  (4)  for  any  arbitrary  input  enclosed  by  the  rule  premise,  i.e.,  x  e  Aju...ja,  where  y*  e  0  is 
the  corresponding  optimal  assignment  decision  vector  and  i=l . n,  j,  =  1, . . . ,  h,-.  Note  that  hi  is  the  grid  count  for  the  ith  var¬ 

iable.  The  approximation  bound  is  then 


BOUNDj  =  sup||y*  -  F7U(x)  || 
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(38a) 

(38b) 


ii.  Let 

P(x,y>  =  sup  ||y  -  y|| 

X6P(X) 

be  the  approximation  error  introduced  by  the  training  data  (x.  y)  e  S.  The  approximation  bound  is  then 
BOUND2  =  sup||y*  -  FTU{x)\\ 

=  :(max{p(x,>} 


(39) 

(40a) 

(40b) 


BOUNDi  <  BOUNDj  (41) 

Note  that  ||  II  is  the  2-norm  or  the  Euclidean  length  of  the  input. 


Proof  2.  Remember  the  Theorem  1  defining  the  FTU’s  approximation  mechanism  for  any  input  and  corresponding  decision, 
i.e.,  x  e  /  and  y*  e  0: 


e  =  \\r-FTU(x)\i 

(42a) 

<.„maX(x)||y-Bj,,...Jn||; 

(42b) 

<  max||y*  -  y 

(42c) 

where  <x,  y)  e  S. 

(i)  Note  that  we  may  write  the  following  fact  for  any  x  6  Aju... Jn : 

Itr  -  Bhl....,h„ll  <  x>sup  ||y*'  -  Bj,  jn  | 

(43) 

where  y*  and  y"  are  the  optimal  assignments  corresponding  to  the  inputs  x  and  x*,  respectively.  Having  (43),  we  may  extend 
(42b)  to  a  global  bound  as  follows: 

BOUND  =  sup||y*  -  F7U(x)|| 

<  supj^  mffiC(x)||y-Bhl,.Ai||| 

<  sup<  max  . . . 

•  ■  ■  {x  Sup  llr*  -Bj,,... 4 11 1| 

j-11} 

(ii)  For  any  x  e  P(x),  we  can  obtain  the  following  result: 

liy-yil  <  sup  liar  — n 

X>6P(X) 

(44) 

where  y*  and  y**  are  the  optimal  assignments  corresponding  to  the  inputs  x  and  x*,  respectively.  Having  (44),  we  may  extend 
(42c)  to  a  global  bound  as  follows: 
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39 


where  (x,y)  6  S. 

(iii)  is  obvious  by  the  (42).  II 

We  have  analyzed  the  approximation  mechanism  of  the  FTU,  and  can  now  check  the  approximation  accuracy  of  the  FTU. 
Note  that  the  FTU  output  does  not  necessarily  constitute  a  valid  decision  vector.  FCUs  map  the  FTU  output  to  a  valid  decision 
vector.  Therefore,  we  must  also  set  an  optimality  bound  for  the  FCUs  (17)  that  is  also  the  optimality  bound  of  the  FDM.  To  do 
so,  we  compare  the  FCU  output  with  the  optimal  decision  vector  through  the  cost  function  associated  with  the  minimization 
problem  (1)  as  follows: 

Theorem  3  (Optimality  Bound  of  the  FDM).  Suppose  that  the  fuzzy  sets  Cj.  (i  =  1 . m  and  jt  =  1 . gj  are  normal,  complete 

and  consistent  on  the  data  set  S'  (13 ).  Let  COST(- )  be  the  cost  function  associated  with  the  minimization  function  (1).  Let 

=  ugsup  |COST(y*)  -  COSr(Djl,...Jm)|  (45) 

be  the  approximation  error  for  any  arbitrary  FCU  layer  input  mapped  to  the  fuzzy  set  Cjl  hr  jm,  which  is  u  =  FTU(x)  e  CJtj2r_jm 
where  Chhjm  =  C,1,  x  C?  x  . . . .  x  Cj" .  Note  that  y*  e  0  is  the  optimal  assignment  decision  vector  for  the  input  xe  I.  The  optimal¬ 
ity  bound  is  then  s 

BOUND3  =  sup  |  COST  (y*)  -  COST  (FDM  (x))\ 

—  max^ej,,...^}  (46a) 

where  G  =  {jhj2, . . .  ,jm\ji  =  1,2 . gnh  =  1,2 . g2,  ■  ■  ■  ,jm  =  1,2,  ...,gm} 

Proof  3.  Because  Q,  j2i... jm  are  complete  and  consistent,  there  exists  at  least  one  Cjlh  jm  such  that  u  =  FTU(x)  e  Cj,  for 

any  x  g  /  where  jl,j2 . jm  e  G.  Moreover,  because  the  FCU  model  (17)  is  a  typical  fuzzy  classifier,  the  output  of  the  FCU 

corresponds  to  exactly  one  of  the  possible  assignment  decisions.  Thus,  we  gain  the  following  result: 
y  =  FCU(u)  e  {Dj,  jm [/, , . . .  ,jm  e  G(u)}  where  G(u)  =  {/],...  ,jm|u  e  Cjlp„jm;j t, . . .  ,jm  e  G}.  We  may  use  following  inequality 
as  the  approximation  mechanism  of  the  FDM,  as  defined  by  (6)  and  (17): 

|COST(y*)  -  COST(FDM(x))\  <  rnax^^l COST(y‘)  -  COST( Djijin)|  (47) 

for  any  xel  where  FDM(x)  =  FCU(FTU(x)).  Note  that  y*  is  the  optimal  assignment  corresponding  to  the  input  x.  From  (47),  the 
global  bound  for  the  FDM  will  be 

BOUND  =  sup | COST(y*)  -  COST(FDM(x))\  <  sup/  max  ...  |COST(y‘)  -  COST(Djl  jm)|} 

xel  xel  U,,..jmeC(u=™(x))  m  J 

<  max  |COST(y*)-COST(DJl,..Jm)|  □ 

h  r-JmSC 

Theorems  2  and  3  allow  for  checking  the  approximation  accuracy  on  a  given  data  set.  The  next  issue  is  guaranteeing  the 
optimality  of  the  FDM.  Note  that  the  optimal  decisions  are  all  in  the  approximation  bound  neighborhood  of  at  least  one  of  the 
fuzzy  sets  belonging  to  the  FCUs.  As  an  illustrative  example,  fuzzy  set  Cj,  is  shown  in  Fig.  2  with  u,.  u2  and  u3,  which  are 
any  input  for  the  FCUs  and  also  any  output  of  the  FTU.  Let  y) ,  y),  and  y(  be  the  optimal  assignments  corresponding  to  these 
inputs/outputs.  As  seen  from  Fig.  2,  the  optimal  assignments  are  in  the  approximation  bound  neighborhood  of  Cjt  j2  jm.  Note 
that  the  FCUs  map  any  input  falling  in  Cj,j2t...jm,  i.e.,  u  £  Cju... Jln,  to  Djl  Jm.  It  is  obvious  that  this  mapping  is  optimal  if  the  opti¬ 
mal  assignment  for  any  input  falling  in  the  fuzzy  set  Cjlj2i...Jm  is  exactly  equal  to  Djl  Jm  for  all  j'i,j2 jm  e  G. 

Based  on  the  rationale  provided  above,  we  can  guarantee  the  optimality  of  the  FDM  as  follows:  (i)  Originate  rule-bases 
through  normal,  complete  and  consistent  fuzzy  sets  for  both  the  FCU  and  FTU.  (ii)  Any  fuzzy  set  belonging  to  FCUs  (i.e., 
Cj,  j2,  jm  f°r  any  Ji.  •  •  •  Jm  £  G)  must  include  exactly  only  one  valid  decision,  (iii)  The  approximation  bound  of  the  FTU  must 
be  small  enough  to  ensure  that  there  exists  one  fuzzy  set  in  the  FCU  layer  that  includes  both  the  FTU  output  and  its  corre- 


Fig.  2.  The  fuzzy  set,  Cj,  on  the  variable 
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sponding  optimal  decision.  We  assert  that  these  three  statements  guarantee  an  optimally  trained  FDM  for  a  given  data  set. 
Note  that  it  is  easy  to  satisfy  statement  (ii)  with  small  rule-bases  for  the  FCU  layer  because  it  requires  only  (t+  l)™  rules. 
However,  statement  (iii)  coexists  with  complex  FTU  models,  denoting  large  rule-bases. 


6.  Simulations  and  results 

In  this  section,  we  analyze  and  discuss  the  performance  of  the  FDM  and  the  effectiveness  of  the  proposed  grid  partitioning 
methodology.  Because  the  primary  goal  of  this  work  is  to  show  the  applicability  of  fuzzy  decision  making  in  weapon  target 
assignment,  we  use  small-scale  scenarios  to  test  the  approach  (i.e.,  a  single  platform  perspective).  To  do  so,  we  consider  two 
different  small-scale  illustrative  examples  with  two  and  four  weapons.  As  a  third  example,  a  large-scale  scenario  with 
twenty  weapons  and  forty  targets  is  considered  to  show  the  scalability  of  the  approach.  Note  that  this  work  primarily  con¬ 
centrates  on  small-scale  WTA  instances  because  the  goal  is  to  show  the  possibility  of  applying  approximate  reasoning  theory 
to  WTA.  To  test  how  FDMs  perform  in  practice,  we  apply  a  K-fold  cross  validation  [  1 7  ].  In  K-fold  cross  validation,  the  original 
sample  set  is  randomly  partitioned  into  k  subsets,  k  -  1  subsets  are  used  in  training,  and  the  remaining  subset  is  set  aside  for 
validation.  The  cross-validation  process  is  then  repeated  k  times  such  that  each  subset  is  used  once  as  the  validation  set. 
Thus,  all  subsets  are  used  both  for  training  and  validation.  We  implement  the  models  and  perform  all  of  our  tests  with  MAT- 
LAB  7.0  on  a  PC  with  a  2.4  GHz  P4  processor  and  3  GB  of  RAM. 


Table  1 

Analysis  results  of  the  FDM  models  for  the  scenario  1. 

Solver  Theoretical 

Approx,  bound 

Fuzzy  Decision  Maker  (FDM) 

Standard  Grid  Partitioning 
Rule  Count  (FTU):  512 

Rule  Count  (FCU  Layer):  54  0.97 

Triangular  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Standard  Grid  Partitioning 
Rule  Count  (FTU):  512 

Rule  Count  (FCU  Layer):  54  0.97 

Trapezoid  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Standard  Grid  Partitioning 
Rule  Count  (FTU):  19,683 

Rule  Count  (FCU  Layer):  686  0.71 

Triangular  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Standard  Grid  Partitioning 
Rule  Count  (FTU):  19,683 

Rule  Count  (FCU  Layer):  686  0.63 

Trapezoid  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Grid  Partitioning  on  Sensitivity  Level 
Rule  Count  (FTU):  125 

Rule  Count  (FCU  Layer):  48  0.68 

Trapezoid  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Grid  Partitioning  on  Sensitivity  Level 
Rule  Count  (FTU):  2352 

Rule  Count  (FCU  Layer):  91  0.55 

Trapezoid  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Grid  Partitioning  on  Sensitivity  Level 
One  rule  per  valid  assignment  decision 
Rule  Count  (FTU):  108,864 

Rule  Count  (FCU  Layer):  8  0.03 

Trapezoid  membership  functions 
Branch  and  Bound  Algorithm  (BBA)  [  1  ] 

Genetic  Algorithm  (GA)  [20] 


0.57 


0.31 


Average  of  simulations 

Approx,  error  Optim.  error  Comp,  time  (s) 


0.74  0.05  <0.01 

(0.12)  (0.008)  (raO.O) 


0.75  0.04  <0.01 

(0.12)  (0.007)  (w  0.0) 


0.60  0.01  0.028 

(0.07)  (0.004)  («0.0) 


0.54  0.01  0.028 

(0.07)  (0.003)  (raO.O) 


0.62  0.05  <0.01 

(0.04)  (0.013)  (raO.O) 


0.47  0.01  <0.01 

(0.03)  (0.005)  (raO.O) 


<  0.01  0.0  0.078 

(w  0.0)  (0.0)  (t»Q.0) 

0.0  0.46 

(0.0)  (0.28) 

0.00  0.2 

(0.0)  (raO.O) 
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6.1.  Scenario  1:  w  =2  and  t  =  3 

We  define  a  simple  small-scale  scenario  involving  two  weapons  and  three  targets.  We  choose  an  illustrative  function  to 
evaluate  the  target  value  as  follows: 

vt  =  1  -  rt/rmax  (48) 

where  rf  e  [O.rmax]  is  the  range  of  the  target  i  =  1 . t  and  rmax  is  the  maximum  scenario  range.  We  choose  a  Gaussian 

function  to  characterize  the  weapon  kill  probability  as  follows: 

Py  =  fye  ci  (49) 

where  a,-  is  the  maximum  kill  probability  for  weapon  j  =  1 w,  bj  is  the  range  where  weapon  j  has  the  maximum  kill  prob¬ 
ability,  and  Cj  is  the  width  of  the  function  bell  (i.e.,  standard  deviation).  For  this  scenario,  we  set  the  maximum  kill  proba¬ 
bilities  as  ai  =  0.7  and  a2  =  0.4,  the  ranges  as  bi  =  5000  and  b2  =  20,000,  and  the  standard  deviations  as  Ci  =  5000  and 
c2  =  20,000. 

We  sample  104  instances  such  that  each  instance  corresponds  to  an  arbitrary  vector  of  target  ranges.  For  each  instance, 
we  evaluate  the  target  values  and  the  weapon  kill  probabilities  by  means  of  (48)  and  (49).  We  then  calculate  the  exact  deci¬ 
sions  dij  using  a  branch  and  bound  algorithm  for  a  particular  Vi  and  py.  Next,  we  generate  data  tuples  (x,y)  by  (2)  and  (3), 
finally  forming  training  data  set  S.  To  elaborate  the  principals  of  the  approach,  we  train  several  FDMs  by  varying  the  training 
parameters  the  rule  generation  method,  the  size  of  the  rule-base,  and  the  membership  function.  The  fuzzy  sets  are  all  nor¬ 
mal,  complete  and  consistent.  In  addition,  we  apply  the  method  described  in  Section  5  to  guarantee  the  optimality  of  the 
FDM.  We  also  implement  the  branch  and  bound  algorithm  (BBA)  [1  ]  and  the  genetic  algorithm  (GA)  [20]  for  comparison  pur¬ 
poses.  Thus,  the  performance  of  the  exact  solver,  the  suboptimal  solver  and  the  FDM  can  be  compared  according  to  both 
computation  time  and  optimization  error.  To  conduct  a  fair  comparison,  the  parameters  of  BBA  and  GA  are  taken  from  [1,20], 

To  evaluate  the  performance  of  the  FDM,  we  use  both  theoretical  and  simulated  results.  For  the  theoretical  analysis,  we 
compute  the  approximation  and  optimality  bounds  by  means  of  Theorems  2  and  3  and  report  the  results  in  Table  1  (the  ‘Ap¬ 
prox.  Bound’  and  ‘Optim.  Bound’  columns).  We  then  directly  compute  the  approximation  and  optimality  errors  within  the 
simulations,  where  we  compare  the  FDM  output  (i.e.,  y)  with  the  optimal  assignment  (i.e.,  y*).  To  evaluate  how  the  FDM  per¬ 
forms  in  practice,  we  apply  a  10-fold  cross  validation.  The  approximation  error  is  calculated  by  taking  the  norm  (2-norm)  of 
the  difference  between  the  optimal  assignment  and  the  FTU  output,  i.e.,  ||y*  -  u||  where  u  =  FTU(x).  The  optimality  error  is 
calculated  by  taking  the  difference  between  the  cost  of  the  optimal  assignment  and  the  cost  of  the  FDM  output  and  normal¬ 
izing  the  difference  by  the  cost  of  the  optimal  assignment.  That  is,  (COST( y)  -  COS71(y*))/COST{y*),  where  y  =  FCU(FTU(x)). 
Note  that  COST( •)  is  the  cost  function  associated  with  the  minimization  problem  (1 ).  We  report  the  simulation  results  in  Ta¬ 
ble  1  (the  ‘Approx.  Error’  and  the  ‘Optim.  Error’  columns).  In  addition,  standard  deviations  are  provided  in  parentheses  where 
applicable. 


Analysis  results  of  the  FDM  models  for  the  scenario  2. 


Approx,  bound 

Fuzzy  Decision  Maker  (FDM) 

Grid  Partitioning  on  Sensitivity  Level 
Rule  Count  (FTU):  1260 

Rule  Count  (FCU  Layer):  350  2.03 

Trapezoid  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Grid  Partitioning  on  Sensitivity  Level 
Rule  Count  (FTU):  110,592 

Rule  Count  (FCU  Layer):  732  1.83 

Trapezoid  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Grid  Partitioning  on  Sensitivity  Level 
One  rule  per  valid  assignment  decision 
Rule  Count  (FTU):  663,552 

Rule  Count  (FCU  Layer):  28  0.07 

Trapezoid  membership  functions 
Branch  and  Bound  Algorithm  (BBA)  [  1  ] 

Genetic  Algorithm  (GA)  [20] 


Approx,  er 


1.68  0.26  <0.01 

(0.11)  (0.052)  (raO.O) 


1.35  0.03  0.11 

(0.09)  (0.006)  (raO.O) 


<0.01  <0.001  0.48 

(raO.O)  (raO.O)  (raO.O) 

0.0  13.56 

(0.0)  (7.4) 

0.0  0.7 

(0.0)  (raO.l) 
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6.2.  Scenario  2:  in  =  4  and  t  =  6 

As  the  second  illustrative  example,  we  define  another  small-scale  scenario  involving  4  weapons  and  6  targets.  Similar  to 
the  previous  example,  we  characterize  the  target  values  and  the  weapon  kill  probabilities  by  (48)  and  (49).  We  set  ai  =  0.7, 
a2  =  0.8,  a3  =  0.8  and  a4  =  0.99;  fa,  =  50,000,  b2  =  15,000,  b3  =  15,000  and  b4  =  5000;  c,  =  3000,  c2  =  5000,  c3  =  5000  and 
c4  =  1 000.  In  defining  the  scenario,  we  form  a  data  set  with  1 04  data  tuples. 

With  this  data  set,  we  train  the  FDMs  in  a  similar  manner  to  the  previous  example.  We  report  the  theoretical  and  sim¬ 
ulated  results  in  Table  2. 

6.3.  Scenario  3:  w=  10  and  t=  10 

As  the  third  illustrative  example,  we  define  a  larger  scenario  to  demonstrate  the  scalability  of  the  approach.  In  this  case, 
the  weapon  count  is  10  and  the  target  count  is  (at  most)  10.  We  characterize  the  target  values  and  the  weapon  kill  proba¬ 
bilities  using  (48)  and  (49).  We  set  a,  =  0.7  for  j  =  1,  2,  3,  a,  =  0.8  for  j  =  4,  5,  6,  a}  =  0.8  for  j  =  7,  8,  aj  =  0.99  for  j  =  9,  10; 
bj  =  50,000  for  j  =  1 ,  2,  3,  bj  =  1 5,000  for  j  =  4,  5,  6,  bj  =  1 5,000  for  j  =  7,  8,  bj  =  5000  for  j  =  9,  10;  q  =  3000  for  j=  1,2,3, 
Cj  =  5000  for  j  =  4,  5,  6,  Cj  =  5000  for  j  =  7,  8,  Cj  =  1000  for  j  =  9, 10.  In  defining  the  scenario,  we  form  a  data  set  with  104  data 
tuples. 

With  this  data  set,  we  train  the  FDMs  in  a  manner  similar  to  the  previous  examples.  We  report  the  theoretical  and  sim¬ 
ulated  results  in  Table  3. 

6.4.  Discussion  of  results 

Tables  1-3  clearly  show  that  the  theoretical  results  agree  with  simulated  results  (in  approximation  and  optimality  er¬ 
rors).  The  approximation  bound  is  slightly  greater  than  the  approximation  error  in  each  case.  However,  the  optimality  bound 
is  significantly  larger  than  the  optimality  error.  These  results  show  that  the  theoretical  analysis  (of  the  approximation  and 
optimality  bounds)  is  valid.  Moreover,  we  can  state  that  while  the  approximation  bound  is  strong,  the  optimality  bound  is 
weak. 

We  can  train  optimal  FDMsas  seen  from  Tables  1  -3  by  applying  the  method  described  in  Section  5.  Therefore,  this  method 
appears  to  be  valid.  However,  it  forces  the  FTUs  to  use  complex  rule-bases,  resulting  in  excessive  memory  and  an  increase  in 
computation  time.  We  note  that  the  optimality  can  only  be  guaranteed  for  the  training  data  set.  It  is  not  expected  that  the 
optimality  error  is  zero  for  any  simulation  because  we  apply  a  K-fold  cross  validation. 

The  simulation  results  in  Tables  1  and  2  show  that  FDMs  with  moderate  or  large  rule-bases  have  satisfactory  approxima¬ 
tion  performances  compared  to  BBAs  and  GAs.  Furthermore,  the  FDMs  are  more  efficient  in  computation  time  compared  to 
the  other  solvers.  Because  it  is  possible  to  simultaneously  evaluate  (or  run)  rules  (i.e.,  concurrently)  for  fuzzy  systems,  the 
total  computation  time  is  theoretically  equal  to  the  computation  time  of  a  single  rule.  Moreover,  it  is  highly  likely  that  the 
computation  time  would  be  worse  for  BBAs  and  Gas,  as  the  cost  calculation  function  becomes  more  complex  because  these 


Table  3 

Analysis  results  of  the  FDM  models  for  scenario  3. 


Approx,  bound  Optim.  bound  error 

Fuzzy  Decision  Maker  (FDM) 

Grid  Partitioning  on  Sensitivity  Level 
Rule  Count  (FTU):  419,904 

Rule  Count  (FCU  Layer):  12,230  6.23  0.69 

Trapezoid  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Grid  Partitioning  on  Sensitivity  Level 
Rule  Count  (FTU):  6,336,000 

Rule  Count  (FCU  Layer):  51,159  4.20  0.27 

Trapezoid  membership  functions 
Fuzzy  Decision  Maker  (FDM) 

Grid  Partitioning  on  Sensitivity  Level 
One  rule  per  valid  assignment  decision 
Rule  Count  (FTU):  707,788,800 

Rule  Count  (FCU  Layer):  110  0.06  0.0 

Trapezoid  membership  functions 
Branch  and  Bound  Algorithm  (BBA)  [1] 

Genetic  Algorithm  (GA)  [20] 


Approx,  error 


3.54 

(0.22) 


2.85 

(0.22) 


Optim.  Comp,  time  (s) 


0.30  0.30 

(0.05)  (raO.O) 


0.01  4.53 

(0.006)  (raO.O) 


<0.001  512 

(raO.O)  (raO.O) 

0.0  399 

(0.0)  (204) 

0.0  3.2 

(0.0)  (0.4) 
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search-based  solvers  run  cost  functions  for  each  enumeration  (candid  decision).  The  complexity  of  the  cost  function  does  not 
affect  the  performance  of  the  FDM.  It  should  be  noted  that  the  solvers  are  all  implemented  in  MATLAB  on  a  PC;  it  is  quite 
possible  to  considerably  decrease  the  computation  time  by  implementing  the  solvers  on  more  sophisticated  platforms  with 
more  sophisticated  programming  tools.  Even  in  this  case,  the  FDM  still  would  be  more  efficient  in  terms  of  computation  time 
due  to  its  concurrent  processing  capability  and  its  independence  from  the  cost  function.  In  addition,  we  notice  from  Table  1 
that  the  membership  function  type  has  only  a  slight  effect  on  approximation  and  optimization  errors.  Therefore,  the  approx¬ 
imation  performance  of  the  FDMs  can  be  slightly  increased  by  optimizing  the  membership  functions. 

Based  on  Table  1,  it  is  evident  that  the  FDMs  trained  by  the  sensitivity-based  grid  partitioning  can  achieve  the  desired 
performance  with  smaller  rule-bases  compared  to  the  FDMs  trained  by  the  standard  grid  partitioning.  Therefore,  less  mem¬ 
ory  is  required  in  the  hardware  to  implement  the  FDM  with  sufficient  accuracy. 

Table  3  shows  that  the  FDMs  exhibit  satisfactory  approximation  performances  with  complex  (large)  rule-bases.  Although 
the  FDM  has  concurrent  processing  capabilities,  it  still  must  sum  a  large  number  of  outputs  (rule  outputs)  for  a  complex  rule- 
base,  thus  considerably  increasing  the  computation  time.  The  FDM  appears  to  be  scalable  (to  some  extent)  for  WTA.  How¬ 
ever,  the  simulation  results  above  show  that  FDMs  are  quite  satisfactory  for  small-scale  WTAs  where  both  weapon  and 
threat  counts  are  small.  The  FDM  can  clearly  be  used  as  a  decision  aid  system  to  help  commanders  for  small  scale  WTA. 

7.  Conclusions 

In  this  work,  we  propose  an  alternative  decision  aid  system  for  WTA  to  assist  commanders  on  the  battlefield.  The  concept 
defines  a  rule-based  mapping  through  approximate  reasoning  theory.  This  work  is  the  first  application  of  approximate  rea¬ 
soning  theory  to  WTA  by  applying  a  grid  partitioning  approach  to  generate  rule-bases.  In  addition  to  grid  partitioning,  we 
propose  grid  counts  by  considering  the  sensitivity  level  of  each  input  variable.  Moreover,  we  suggest  a  method  to  guarantee 
the  optimality.  The  trained  FDMs  show  satisfactory  performance,  particularly  for  small-scale  WTA. 

We  rely  on  other  solvers  to  provide  samples  while  training  the  FDMs,  meaning  that  once  the  problem  size  is  changed,  we 
must  reform  the  training  data  set.  To  shorten  the  computation  time  required  to  form  the  training  data  set  on  the  battlefield, 
we  propose  two  robust  alternatives.  The  first  (and  obvious)  alternative  is  to  form  several  training  sets  for  each  possible  sce¬ 
nario  and  train  one  FDM  for  each  training  set.  Once  the  problem  size  is  changed  for  the  first  alternative,  one  simply  uses  the 
appropriately  trained  FDM.  The  first  alternative  satisfies  the  optimality  but  may  require  a  large  number  of  stored  FDMs.  The 
second  alternative  is  to  train  only  one  FDM  (considering  the  maximum  possible  assignments)  such  that  we  form  a  single 
training  data  set  where  the  weapon  count  is  exactly  equal  to  the  number  of  weapons  on  the  platform  (and  the  target  count 
is  set  equal  to  the  weapon  count).  The  second  alternative  is  applied  as  follows:  When  the  real  target  count  (denoted  as  tr)  is 
greater  than  the  target  count  of  the  training  set  (denoted  as  t),  we  sort  the  real  targets  according  to  the  target  values  and  run 
the  FDM  for  the  first  t  targets.  When  the  real  target  count  is  less  than  the  target  count  of  the  training  set,  we  create  t  -  tr 
artificial  targets  with  zero  values  and  run  the  FDMs  for  tr  real  targets  and  t  -  tr  artificial  targets.  Though  the  second  alterna¬ 
tive  does  not  require  the  storage  of  a  large  number  of  FDMs,  it  does  not  satisfy  the  global  optimality. 

The  approach  appears  to  be  scalable  to  some  extent.  We  believe  that  the  reason  for  this  limitation  is  the  grid  partitioning 
method.  Suppose  that  an  assignment  problem  is  to  be  solved  for  w  weapons  against  maximum  t  targets.  When  we  divide 
each  input  variable  into  h  fuzzy  grids,  a  complete  rule-base  should  have  hn  fuzzy  rules  where  n  =  t  +  wt.  Obviously,  it  is  infea¬ 
sible  to  design  a  fuzzy  decision  maker  for  weapon  target  assignment  with  a  large  number  of  input  variables.  In  our  work,  we 
address  this  problem  by  choosing  the  grid  counts,  considering  the  sensitivity  level  of  each  input  variable.  However,  there  are 
a  large  number  of  more  sophisticated  rule  extraction  methods  in  the  existing  literature.  Future  works  could  apply  state-of- 
the-art  rule  extraction  methods  within  our  approach  to  reduce  the  number  of  rules,  thus  saving  computation  time. 

We  should  note  that  the  sensitivity  level  is  computed  on  a  given  data  set  by  approximating  the  underlying  function  to  a 
linear  continuous  function.  We  believe  that  this  idea  can  be  further  extended  by  choosing  other  functions  in  which  the 
underlying  function  is  approximated.  Moreover,  to  guarantee  the  optimality  in  this  work,  we  require  complex  FTUs  (i.e., 
large  rule-bases).  We  believe  that  the  theoretical  study  could  be  improved  to  achieve  better  approximation  and  optimality 
bounds,  which  may  lead  to  new  methods  to  guarantee  the  optimality. 
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